Exploring Linguistic Features for Named Entity Disambiguation

نویسندگان

  • Shuangshuang Zhou
  • Canasai Kruengkrai
  • Naoaki Okazaki
  • Kentaro Inui
چکیده

Resolving named entities is important for a number of natural language processing applications. However, a named entity has multiple name variations while different entities could share the same surface. State-of-the-art systems are based on a global resolution method and mostly adopt link-based features that leverage relationships of co-occurring entities in the knowledge. We found that linguistic features can also significantly affect disambiguation. In this work, we try to explore important linguistic features from context, which could be the fundamental part of the combination of global resolution method and effective features. Therefore, we study and compare the effects of linguistic features in a comprehensive way. Moreover, we found effective linguistic features according to the experiment results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring Entity Relations for Named Entity Disambiguation

Named entity disambiguation is the task of linking an entity mention in a text to the correct real-world referent predefined in a knowledge base, and is a crucial subtask in many areas like information retrieval or topic detection and tracking. Named entity disambiguation is challenging because entity mentions can be ambiguous and an entity can be referenced by different surface forms. We prese...

متن کامل

CU-COMSEM: Exploring Rich Features for Unsupervised Web Personal Name Disambiguation

The increasing number of web sources is exacerbating the named-entity ambiguity problem. This paper explores the use of various token-based and phrase-based features in unsupervised clustering of web pages containing personal names. From these experiments, we find that the use of rich features can significantly improve the disambiguation performance for web personal names.

متن کامل

U-AIDA: a customizable system for named entity recognition, classification, and disambiguation

Recognizing and disambiguating entities such as people, organizations, events or places in natural language text are essential steps for many linguistic tasks such as information extraction and text categorization. A variety of named entity disambiguation methods have been proposed, but most of them focus on Wikipedia as a sole knowledge resource. This focus does not fit all application scenari...

متن کامل

J-NERD: Joint Named Entity Recognition and Disambiguation with Rich Linguistic Features

Methods for Named Entity Recognition and Disambiguation (NERD) perform NER and NED in two separate stages. Therefore, NED may be penalized with respect to precision by NER false positives, and suffers in recall from NER false negatives. Conversely, NED does not fully exploit information computed by NER such as types of mentions. This paper presents J-NERD, a new approach to perform NER and NED ...

متن کامل

Features for Web Person Disambiguation

Entity disambiguation resolves the many to many correspondence between mentions of entities in text and unique real-world entities. Our entity disambiguation uses language-independent entity context to agglomeratively resolve mentions with similar names to unique entities. This paper describes our automatic entity disambiguation capability and assesses its performance on the second Web People S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. Comput. Linguistics Appl.

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2014